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Abstract 

Background: Haemocyanin is the respiratory protein of most of the Mollusca. In cephalopods and gastropods at 
least two distinct isoforms are differentially expressed. However, their physiological purpose is unknown. For the 
common cuttlefish Sepia officinalis, three isoforms are known so far, whereas for only two of them the complete 
mRNA sequences are available. In this study, we sequenced the complete mRNA of the third haemocyanin isoform 
and measured the relative expression of all three isoforms during embryogenesis to reveal a potential ontogenetic 
relevance. 

Results: The cDNA of isoform 3 clearly correlates to the known Sepia officinalis haemocyanin subunits consisting 
of eight functional units and an internal duplicated functional unit d. Our molecular phylogenetic analyses reveal 
the third isoform representing a potentially ancestral haemocyanin isoform, and the analyses of the expression 
of haemocyanin type 3 reveal that haemocyanin type 3 only can be observed within eggs and during early 
development. Isoforms 1 and 2 are absent at these stages. After hatching, isoform 3 is downregulated, and 
isoform 1 and 2 are upregulated. 

Conclusions: Our study clearly shows an embryonic relevance of the third isoform, which will be further discussed 
in the light of the changes in the physiological function of haemocyanin during ontogeny. Taken together with 
the fact that it could also be the isoform closest related to the common ancestor of cuttlefish haemocyanin, 
the phylogeny of cuttlefish haemocyanin may be recapitulated during its ontogeny. 
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Background 

Haemocyanin is the respiratory protein of arthropods 
and molluscs. This copper-containing pigment occurs 
freely dissolved in the haemolymph of these animals and 
is coloured blue when oxygenated. The haemocyanins of 
both mentioned phyla have a copper type-3 oxygen- 
binding site, but otherwise show structural differences 
and evolved separately [1-3]. The basal quaternary struc- 
ture of molluscan haemocyanins is a cylindrical decamer 
composed of -350-400 kDa large subunits (SU) [4]. In 
Polyplacophora and Cephalopoda, only decamers can be 
observed floating in the haemolymph [5,6]. In Bivalvia 
and Gastropoda, they are also able to form didecamers 
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and multidecamers [7,8]. The single subunits usually 
consist of 7-8 tandemly arranged functional units (FU) 
of -50 kDa each. Every FU exhibits one oxygen-binding 
site [1]. These paralogous FUs evolved by subsequent 
gene/exon duplications and fusions [9,10]. The subunits 
of cephalopod haemocyanins are characterised by the lack 
of the most C-terminal FUh [6,10]. The 350 kDa hae- 
mocyanin subunits of Tetrabranchiata and Octobrachia 
consist of 7 FUs (a-g), contrary to the 8 FUs (-400 kDA) 
in Decabrachia [1,11]. For the common cuttlefish (Sepia 
officinalis) and Spirula (Spirula spirula), it has been shown 
that a duplication of FUd led to the 8 FUs (a-b-c-d-d'-e-f-g; 
[12,13]). 

Besides the duplications of single FUs, duplications of 
the complete haemocyanin gene occurred as well. This 
happened several times independently in different groups 
and resulted in different isoforms of the protein [4]. 
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Gastropods and bivalves can feature one or two isoforms 
[1,8,14-20], while in cephalopods probably only the 
tetrabranchiate Nautilus pompilius is restricted to a sin- 
gle haemocyanin [6]. The recent coleoid cephalopods 
were all found to express at least two different isoforms 
of the respiratory protein (e.g., Sepia officinalis, DeGeest 
et al, unpubl; Allotheuthis media, Oellermann et al, 
unpubl; E. dofleini [9]). So far, the complete mRNA 
sequences of haemocyanin isoform 1 (SoHl [GenBank: 
DQ388569]) and isoform 2 (SoH2 [GenBank:DQ388570]) 
have been identified in Sepia officinalis. Next to these two 
known isoforms there is great evidence for a third one in 
the cuttlefish (SoH3; [21,22]), whereof one fragment 
of ~ 1,000 bp has already been sequenced [GenBank: 
JN392726]. The existence of these isoforms raises 
questions about their evolution referring to origin and 
structural changes as well as to their physiological 
relevance. 

Due to its moderate evolutionary rate, haemocyanin has 
been used to estimate the divergence time of different 
groups among Mollusca, e.g., the most recent common 
ancestor of cephalopods, Dibranchiata and Decabrachia 
was calculated [13]. Additionally, the duplication of the 
S. officinalis isoforms could be dated to -90 million 
years ago (Mya), whereas the isoforms of E. dofleini split 
much more recently. In both isoforms of S. officinalis— %.s 
well as in S. spirula— the duplicated FUd is present 
[11,13]. This points to a duplication event of FUd earlier 
than the split of the isoforms. Yet with no complete 
sequence of the third isoform, no further implications 
concerning the origin and evolutionary development of 
that isoform can be drawn. 

However, all isoforms are expressed, and SoHl and 
SoH2 even display different theoretical isoelectric points 
(in silico: SoHl: pi = 5.79, SoH2: pi = 5.85), which in turn 
could affect the oxygen affinity [21,22]. The existence of 
different isoforms might therefore serve a physiological 
purpose. Indeed, it has already been shown that some 
vertebrates and invertebrates can change the composition 
or concentration of their respiratory protein in response 
to changing environmental parameters such as tempe- 
rature and oxygen partial pressure P0 2 (Gadus morhua 
[23], Daphnia magna and D. pulex [24,25]; Triops longi- 
caudatus [26], Artemia franciscana [27]). Moreover, 
cephalopods display the highest metabolic rates among 
invertebrates and therefore have to rely on an efficient 
oxygen supply, which is also indicated by the great 
Bohr effect of their haemocyanin [28-30]. Neverthe- 
less, S. officinalis displays no difference in relative 
expression of the haemocyanin isoforms in response 
to changing temperature, hypercapnia or hypoxia [22, 
Thonig et al, unpublished]. 

Besides adaptation to changing environmental condi- 
tions, different isoforms of respiratory proteins are also 



known to be expressed during the course of ontogeny 
(Crustaceans [31], Cancer magister and C. productus 
[32-35], Artemia salina [36], Schistocerca americana 
[37], Chironomus plumosus and Chironomus thummi 
[38,39], chicken [40] and mammals [41]). For example, 
humans start with the expression of embryonic haemo- 
globin, which is replaced by foetal, and afterwards by 
adult haemoglobin (e.g., [42]). Likewise, the maternal 
haemocyanin in the oocytes of the crustacean Cancer 
magister is subsequently replaced by embryonic, juvenile 
and adult haemocyanin [43]. 

Thus, the evolution of embryonic versions of the 
respective respiratory proteins (haemoglobin and haemo- 
cyanin) occurred independently in invertebrates and verte- 
brates. This might indicate common challenges referring 
to oxygen supply during early development. Therefore, an 
ontogenetic relevance of the different haemocyanin iso- 
forms could be the case for Sepia officinalis. This is 
assumable especially as cuttlefish embryos have to face 
hypoxic and hypercapnic conditions towards the end of 
embryogenesis since their egg capsule functions as a diffu- 
sion barrier [44-46]. Actually, the existence of an embry- 
onic haemocyanin in cuttlefish had already been proposed 
on the basis of electrophoretic distinct protein fractions of 
haemocyanin in embryos, juveniles and adults of Sepia 
officinalis [47,48] . Furthermore, Beuerlein et al [49] iden- 
tified haemocyanin mRNA [GenBank:AF401231] not only 
in the branchial glands of cuttlefish embryos, which would 
reflect the situation in adults, but also in haemocytes, the 
branchial hearts, the midgut gland and the renal append- 
ages. This indicates both a switch of function in the 
haemocyanin metabolism of these tissues accompanied 
with hatching and a change of the site of synthesis of 
haemocyanin during ontogeny [50]. This would be well 
in line with a shift of haemocyanin expression. Finally, 
Strobel et al [22] detected the highest relative expression 
of the recently found potential isoform 3 in newly hatched 
cuttlefish, while it decreased significantly with time after 
hatching. This could point out an embryonic function of 
haemocyanin isoform 3. 

We therefore identified the complete sequence of 
haemocyanin isoform 3, and revealed its origin and evolu- 
tionary history. On the other hand, we tried to elucidate 
the role of this isoform 3 during embryogenesis by analys- 
ing the relative expression of the haemocyanin isoforms in 
different embryonic stages of the cuttlefish Sepia officina- 
lis; in this way, we combined an evolutionary and physio- 
logical approach to characterise the new differentially 
expressed haemocyanin isoform 3 of S. officinalis. 

Methods 

Animal collection and sampling 

Clusters of cuttlefish eggs were collected in May 2011 in 
Chioggia (Adria, Italy) and transferred to and kept at the 
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Alfred Wegener Institute in Bremerhaven, Germany. For 
further investigations, the egg capsules were removed, 
and the length and width of the embryos as well as 
their developmental stages according to Lemaire [51] 
were determined. We distinguished five different 
stages: El (Lemaire 21-24), E2 (Lemaire 25-26), E3 
(Lemaire 26-27), E4 (Lemaire 28-29) and H (Lemaire 30). 
Hereafter, either the complete embryos, the gills or the 
outer yolk sac were immediately frozen in liquid nitrogen 
for subsequent RNA analysis. Additionally, adult speci- 
mens collected in the English Channel (Caen) in 2010 and 
raised at the Alfred Wegener Institute at 15°C were 
used to compare the expression of haemocyanin mRNA 
(Thonig et al, unpublished). 

cDNA synthesis 

Due to the low weight of the individual embryonic and 
tissue samples, RNA was mainly extracted from pools of 
different individuals of the same developmental stages 
using the RNeasy Mini Kit (Qiagen GmbH, Hilden, 
Germany). The pools ranged from one to eight indi- 
viduals corresponding to 9 to 102 mg input material 
(Table 1). The samples were homogenised for 15 s at 
6,500 rpm in a Precellys beadmill (Bertin Technologies, 
Peqlab Biotechnologies GmbH, Erlangen, Germany) 
with doubled amount of recommended lysis buffer 
RLT (Qiagen GmbH, Hilden, Germany) at room tempe- 
rature. Further extraction procedures followed the RNeasy 
Mini Kit protocol. A DNase treatment of each extract was 
carried out according to the Ambion Turbo DNA-free™ 



Table 1 Tissue and embryo pools of 5. officinalis used for 
cDNA synthesis and subsequent real-time qPCR 



Name 


Stage 
(Lemaire) 


Tissue 


No. of 
individuals 


Input 
material [mg] 


Comment 


E1_Y 


21-23 


Yolk 


2 


102 


No statistics 


E1_E1 


21-23 


Embryo 


8 


49 




E1_E2 


21-23 


Embryo 


2 


13 




E2_E1 


24-26 


Embryo 


3 


45 




E2_E2 


24-26 


Embryo 


2 


25 




E2_E3 


24-26 


Embryo 


2 


28 




E3_E1 


26-27 


Embryo 


2 


43 




E3_E2 


26-27 


Embryo 


2 


45 




E3_E3 


26-27 


Embryo 


1 


29 




E3_E4 


26-27 


Embryo 


1 


34 




E3_E5 


26-27 


Embryo 


2 


48 




E3_E6 


26-27 


Embryo 


1 


31 




E4_E1 


28-29 


Embryo 


2 


19 




E4_E2 


28-29 


Embryo 


1 


47 




E4_E3 


28-29 


Embryo 


1 


32 




H_G 


30 


Gill 


4 


12 


No statistics 



Kit (Ambion, Applied Biosystems, Darmstadt, Germany). 
Afterwards, cDNA synthesis was performed via RT-PCR 
with 0.4 ug of RNA either according to the High Capacity 
cDNA Reverse Transcription Kit (Applied Biosystems, 
Darmstadt, Germany) for Real-Time PCR or according to 
the Invitrogen Superscript III cDNA synthesis kit (Life 
Technologies) for subsequent sequencing of the haemo- 
cyanin isoform 3 cDNA. 

qPCR and statistical analysis 

The relative expression of the different haemocyanin 
isoforms was analysed using an ABI 7500 Real-Time 
PCR System with the respective 7500 SDS Software, ver- 
sion 1.3 (Applied Biosystems, Darmstadt, Germany). The 
20 ul PCR reaction was performed in MicroAmp® Optical 
96 well plates (Applied Biosystems, Darmstadt, Germany) 
using the SYBR @ Green PCR Master Mix (Applied Biosys- 
tems, Germany). Each qPCR replicate contained 2 ng of 
cDNA and isoform-specific primers with a final con- 
centration of 300 nM. The primers were designed and 
optimised by Strobel et al [22] on the basis of the respect- 
ive sequences available in GenBank: 

Isoform 1 [GenBank:DQ388569]: 

SofHclJwd CTTTTCGAGTTTACCAGCTCTTGTT, 

SofHcl_rev CCTGCAACGTCAATATATGAGTGAT. 

Isoform 2 [GenBank:DQ388570]: 

SofHc2_fwd TCTCCCGTTTTTGGTAACTGAAC, 

SofHc2_rev TGTC AG C AACATC AATATAACC ATG A. 

Isoform 3 [GenBank:JN392726]: 

SofHc3_fwd TGCTCCGTCAACCAATGTCC, 

SofHc3_rev TCCAGGGCATCTCGTTTTCAC. 

The PCR programme was composed of a 10 min 
activation step at 95°C and 40 amplification cycles (15 s 
at 95°C for denaturing, 1 min at 60°C for annealing and 
elongation) with a single fluorescence measurement dur- 
ing the annealing step. A melting curve analysis was per- 
formed to ensure the amplification of one single product. 
Efficiencies for the different primer pairs were calculated 
according to the equation E = io (_1/slope) , using the decadic 
logarithm of cDNA input plotted against the cycle thresh- 
olds (C T ) for five template dilution steps (efficiency SoHl: 
1.843, SoH2: 1, 1.859, SoH3: 1.824 ). 

For further analysis, the C T -values were trans- 
formed into relative quantities [52] using MS Excel 
according to the following equation: quantity (x) = ef- 
ficiency (C t < Minimum )- c T (*». Furthermore, the relative 
quantities were normalised to adjust differences in tem- 
plate input using the normalisation coefficients calculated 
by geNorm [53] . For this reason, the potential housekeep- 
ing genes cytochrome c oxidase (COX) and nucleolin were 
analysed via Real-Time PCR as well. The respective primer 
pairs were designed on the basis of 454 pyrosequencing 
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results of cDNA of the branchial gland of adult Sepia offi- 
cinalis (Lieb & Mark, unpublished) using Primer Express 
Software for Real-Time PCR, version 3.0 (Applied Biosys- 
terns, Darmstadt, Germany) (nucleolin: Nuc_fw CAAGCC 
G C A A AG A A AG AC A AG , Nuc_rv GCTTTAGATTCCT 
TTTTGGAAGCA, COX: COX_fw CAATAGGAGCCGT 
TTTTGCG, COX_rv AGGGTAATCAGAGTATCGTCG 
TGG) and yielded efficiencies of 1.719 for COX and 1.809 
for nucleolin. The relative quantities of the respective C T 
values of the potential housekeeping genes were then used 
to calculate the normalisation coefficients for the expres- 
sion of the haemocyanin isoforms with geNorm. 

Finally, Nalimovs test [54] on a significance level of 
95% was performed with the normalised relative quan- 
tities of the haemocyanin isoforms to remove outliers. 
The data were illustrated as the mean with standard 
error using GraphPad Prism 5.0b (GraphPad Software, 
Inc., La Jolla, CA, USA) and the expression trend using 
Adobe Illustrator (Adobe Systems Software Ireland Ltd., 
Ireland). Statistical analysis to determine significant differ- 
ences in expression of the three haemocyanin isoforms 
during embryonic development was performed using the 
non-parametric Mann- Whitney U-Test via SPSS Statistics 
19 (IBM, Ehningen, Germany). 

Sequencing 

The SoH3 cDNA sequence was obtained via primer 
walking using cDNA derived from yolk and embryonic 
samples of the earliest developmental stage as template 
to ensure a large amount of SoH3 mRNA. Touchdown 
PCRs were performed with one degenerated primer 
coding for the conserved copper-binding sites [10] and 
one SoH3-specific primer designed with MacVector 12.0 
(MacVector, Inc., Cambridge, UK) based on the already 
known fragment of SoH3. The touchdown PCR pro- 
tocols consisted of a 4 min activation step at 94°C, 12 
touchdown cycles with the annealing temperature de- 
creasing by 1°C per cycle and finally 35 amplification 
cycles. The annealing temperature and elongation time 
of the PCR protocols were chosen with respect to the 
melting temperature of the primer pair and the length of 
the expected PCR product. The amplicons were sepa- 
rated in a 1.3% agarose gel with GelRed and extracted 
using the QIAQuick Gel Extraction Kit (Qiagen GmbH, 
Hilden, Germany). Afterwards, the extracts were either 
used for an additional amplification and purification step 
to increase product concentration or sequenced directly 
by Eurofins MWG GmbH (Martinsried, Germany). 

Additionally, PCR products were obtained using two 
degenerated primers [10]. Those ones and PCR products 
giving double peaks after the sequencing reactions were 
cloned into the pGemT-easy vector (Promega, Madison, 
WI, USA) or TOPO TA vector (Life Technologies Invi- 
trogen, Darmstadt, Germany). A PCR with the standard 



vector primer M13 uni and M13 rev was followed by an 
RFLP analysis using several restriction enzymes (Muni, 
Hinfl, Alul, Spnl, Mbol, Rsal) to determine different 
inserts. These inserts were then purified using the 
QIAQuick Gel Extraction Kit and sequenced by Eurofins 
MWG GmbH (Martinsried, Germany) or GATC Biotech 
AG (Konstanz, Germany). 

The 5 '-end and 3'- end of the SoH3 mRNA were 
amplified with the Ambion FirstChoice @ RLM-RACE Kit 
and the Invitrogen GeneRacer Kit (Life Technologies Invi- 
trogen, Darmstadt, Germany) according to the manual 
and sequenced by Eurofins MWG GmbH (Martinsried, 
Germany) or GATC Biotech AG (Konstanz, Germany). 

Additionally, genomic DNA was isolated from cuttle- 
fish branchial glands using Qiagen DNeasy columns 
according to the manufacturer s protocol. After shearing 
of 1 ug DNA (Covaris S2), an Illumina paired-end library 
was constructed according to TruSeq DNA Sample 
Preparation v2 protocol. First, the DNA was end re- 
paired, 3 '-ends were adenylated and adapters were liga- 
ted. Subsequently the DNA was amplified (10 cycles) 
and double size selected using the SPRIselect reagent kit 
(Beckman Coulter). The resulting DNA fragments ranged 
between 300 and 600 bp with a mean size of ca. 280 bp. 
Sequencing was done on 1.375 lanes of an Illumina HiSeq 
2000 paired-end flowcell resulting in approximately 
400,000,000 reads. DNA processing and sequencing were 
done by Genterprise (Mainz). 

Sequence analysis 

The single sequences were trimmed, corrected on the 
basis of the chromatogram and assembled to a complete 
SoH3 sequence using Geneious 6.1.2 (Biomatters). Over- 
laps of at least -150 bp were used to assure a correct 
assembly. The Illumina reads were trimmed and mapped 
on the cDNA sequence allowing maximum 10% mis- 
matches per read. From this mapping the position of 
introns and the FUs could be deduced according to the 
GT/AG rule. The 5'- and 3'-UTRs were annotated 
according to the derived open reading frame (ORF). The 
respective protein sequence was obtained via translation 
using the standard genetic code. Furthermore, a signal 
peptide was detected via SignalP v4.1 and potential 
N-glycosylation sites using NetNGlycl.O. Finally, the 
molecular weight as well as the theoretical isoelectric 
point of haemocyanin subunit 3 was determined via 
ExPASy Bioinformatics Resource Portal [55]. 

For phylogenetic analyses the haemocyanin sequences 
of Sepia officinalis (SoHl [GenBank:DQ388569], SoH2 
[GenBank:DQ388570]), Enteroctopus dofleini (A-type 
[GenBank:AY751301], G-type [GenBank:AF338426]), 
Nautilus pompilius [GenBank:AJ619741] and Haliotis 
tuberculata (HtHl [GenBank:AJ252741], HtH2 [GenBank: 
AJ297475]) as outgroup were added from GenBank. For 
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those sequences the FUs were annotated according to the 
genomic data of S. officinalis [6,10]. The multiple se- 
quence alignments of the complete haemocyanin subunits 
and the FUd/d' were obtained using MUSCLE imple- 
mented in Geneious 6.1.2. The alignments were curated 
using Gblocks 0.91b ([56]; parameters used for haemocya- 
nin/FUd alignment: minimum number of sequences for a 
conserved position: 5, minimum number of sequences for 
a flanking position: 6/7, maximum number of contiguous 
nonconserved positions: 8, minimum length of a block: 
10; allowed gap positions: none, use similarity matrices: 
yes) to eliminate poorly aligned positions. Subsequently, 
we determined the appropriate substitution model via 
ProtTest 3.2 [57] according to the corrected Akaike 
information criterion (AICc): WAG + I + G + F (I = 2.199, 
G = 0.199 for complete He sequences), WAG + G + F 
(G = 0.799 for FUd/d'). Based on these substitution 
models, Bayesian trees were calculated via MrBayes 
3.2.1 [58] running an MCMC analysis with 2,000,000 
generations and a sampling frequency of 200 genera- 
tions. According to the respective trace files, the like- 
lihoods had reached stationarity before 20% of the 
MCMC chain, which was therefore used as the burnin 
fraction. Additionally, maximum likelihood trees were 
calculated using the PhyML plugin of Geneious 6.1.2 [59] 
performing 1,000 replicates to calculate bootstrap values. 
Since the PhyML plugin did not allow to account for the 
amino acid frequencies (+F), the following substitution 
models were chosen: LG + I + G (I = 1.316, G = 0.144; 
for complete He sequences), WAG + 1 + G (I = 0.193, 
G = 1.696 for FUd/d'). 

A relaxed molecular clock was calculated with BEAST 
vl.5.4 using a Bayesian MCMC analysis [60]. The follow- 
ing settings were chosen: WAG + 1 + G as substitution 
model, an uncorrelated log-normal distribution as clock 
mode and birth-death process as tree prior. Two ana- 
lyses were run for 10,000,000 generations and were 
logged every 1,000 generation. Stationarity and appropri- 
ate mixing of each was checked using Tracer vl.5 [60]. 
The runs were combined using LogCombiner and the 
resulting file was processed with TreeAnnotator using a 
burnin of 20%. An additional run only under the priors 
was performed to confirm that the chosen priors were 
sufficiently non informative. The split of Gastropoda- 
Cephalopoda in the late Cambrian 550 ± 50 Mya was 
used as calibration point [6,7,17]. 

Ethics statement 

All sampling of cuttlefish was conducted according to 
the ethics and guidelines of the German law and do 
not require a formal permit. The experiments have 
been communicated according to § 8 animal welfare 
act (18.05.2006; 8081. I p. 1207) to the veterinary inspec- 
tion office 'Senatorin fur Arbeit, Frauen, Gesundheit, 



Jugend und Soziales, Abt. Veterinarwesen, Lebensmit- 
telsicherheit und Pflanzenschutz', Bahnhofsplatz 29, 
28195 Bremen, Germany, evaluated and approved of on 
July 12, 2011. 

Results 

Haemocyanin mRNA expression 

To investigate a potential ontogenetic relevance of the 
existence of different haemocyanin isoforms in cuttlefish, 
we measured their expression in different developmental 
stages of embryogenesis via Real-Time PCR. Thereby, 
we detected distinct differences during ontogeny refer- 
ring to both the relative expression (Figure 1) and the 
ratio of the isoforms (Table 2). As shown in Figure 1, the 
mRNA of isoform 3 is highly expressed in embryos and 
also in the yolk sac of the earliest developmental stage. 
While SoH3 remains almost constantly at a high level of 
expression during embryonic development, significant 
amounts of SoH2 and SoHl appear only during late 
organogenesis in stage E3 and E4, respectively. Though 
SoH2 could be detected in very low amounts already at 
stage El (even in the yolk sac) and E2, it only increased 
significantly during the following stages E3 and E4. 
Finally, SoHl was first present at stage E3 with signifi- 
cantly increased expression shortly before hatching (E5). 
Thereby SoH3 represents about 99.8%- 100% of the 
haemocyanin expressed in early organogenesis (El + E2) 
measured in the complete embryo (Table 2). Before 
hatching, SoH3 accounts for 66.6% of haemocyanin, 
while SoH2 represents 31.7% and SoHl 1.7%. In gills of 
hatchlings (H), ratios of the haemocyanin isoforms were 
measured as follows: SoH2 90.1%, SoHl 9.7% and SoH3 
0.3%. This reflects the situation found in the branchial 
gland of adult cuttlefish (incubation at 15°C): SoH2 
87.2%, SoHl 12.8% and SoH3 0.03%. 

SoH3 sequence 

The SoH3 mRNA sequence (GenBank:KF306341) 
consists in total of 10,277 bp. It is composed of a 39 bp 
5'-UTR, a 10,017 bp ORF containing the eight FUs and 
a 221 bp 3-UTR. The typical AATAAA polyadenylation 
signal can be found 13 bp upstream of the poly- (A) tail. 
Partly different versions of the haemocyanin sequence 
were obtained from sequencing and could not definitely 
be assigned referring to either the chromatogram or 
genomic data, or via cloning. Therefore, 20 ambiguous 
sites of 10,277 bp in total (representing 0.2%) remain in 
the sequence. They are mainly spread in the FUa (5), 
FUb (8) and FUc (7). 

The deduced primary structure consists of 3,338 
amino acids (aa), including a signal peptide of 26 aa and 
the actual polypeptide of 3,312 aa. This consists of 8 FUs 
ranging from 409 to 418 aa, reflecting the FUs found in 
the other known isoforms. Thus, the duplication of FUd/d' 
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Developmental stage 

Figure 1 Haemocyanin expression during ontogeny. The mRNA expression of the different haemocyanin isoforms (SoHl, SoH2, SoH3) of 

Sepia officinalis in different tissues (yolk, embryo, gill, branchial gland) and developmental stages ranging from embryos (E1-E4) to hatchlings (H) 

and adults (A). The histogram shows the mRNA expression as normalised relative quantities with standard error. Thereby, asterisks indicate when 

the expression of the respective isoform differs significantly compared to the previous developmental stage on a significance level of 0.05 

according to the Mann-Whitney-U test. Due to the low number of samples for E1 yolk and J gill, those stages are not taken into account for 

statistics. The lines illustrate the shift of the haemocyanin isoforms. 
v J 



is also present in SoH3. The translated sequence still 
contains 0.2% ambiguities (6 aa of 3,338 aa in total: FUa: 2, 
FUb: 2, FUc: 2), which all resemble isofunctional amino 
acid exchanges. The linker introns found between the 
FUs are all in phase 1. The internal introns in the signal 
peptide (phase 0), FUb (phase 2), FUe (phase 0) and FUf 
(phase 0) reflect the situation found in E. dofleini refer- 
ring to location and phase [10], whereas the one found 
in FUg (phase 1) represents the internal intron also 
present in N. pompilius [6] (Figure 2). The haemocyanin 
protein SoH3 has a calculated molecular mass of 
377,793 Da with a theoretical isoelectric point (pi) of 
5.94. Each FU shows the characteristics for molluscan 
haemocyanin also found in Enteroctopus and Nautilus [6] 



Table 2 Percentage of the different isoforms adding to 
haemocyanin mRNA 



Sample 


SoHl 


SoH2 


SoH3 


E1_Y 


0 


0.001 


99.999 


E1_E 


0 


0.016 


99.984 


E2_E 


0 


0.249 


99.751 


E3_E 


0.031 


3.499 


96.469 


E4_E 


1.657 


31.73 


66.613 


H_G 


9.674 


90.062 


0.264 


A_BG 


12.799 


87.174 


0.027 



The percentage is given for the respective tissues (Y: yolk; E: embryo; G: gill; 
BG: branchial gland) of different ontogenetic stages ranging from embryos 
(E1-E4) to hatchlings (H) and adults (A). 



(Figure 3). These are six histidine residues for copper 
binding (histidine residues in total: 173) as well as 
one thioether bridge (Cys73-His76) and three poten- 
tial disulphide bridges (Cys62-Cys73, Cysl87-Cys254, 
Cys344-Cys356), whereas the last one is not present 
in FUb and FUc. Ten potential glycosylation sites can 
be detected, nine NXT motifs and one NXS motif 
(Figure 3), whereas in FUc, FUd and FUe the central 
proline should block the glycosylation site [61]. The 
observed glycosylation pattern of SoH3 reflects the one 
already described for cuttlefish SoHl [62] and cepha- 
lopods in general [6]: no glycosylation of FUc, one con- 
served glycosylation site (aa 415) in FUa, FUb, FUd, FUd' 
and FUe, one in FUg (aa 66) and another one present in 
FUd and FUd' (aa 268), which is probably not glyco- 
sylated in FUd' [62]. Differences concerning the gly- 
cosylation appear in FUe and FUf, in which SoH3 
exposes more potential sites for glycosylation (FUe: aa 
14 and aa 125, FUf: aa 163 and aa 402). The amino 
acid composition between the isoforms does not differ 
considerably. 

Phylogeny of cephalopod haemocyanin 

The amino acid sequences share identities of 81.7% for 
SoHl-SoH2, 78.5% for SoHl-SoH3 and 77% for SoH2- 
SoH3. The topology of the maximum likelihood and the 
Bayesian trees are alike, therefore both results are 
displayed in one tree given both the support and boot- 
strap values, respectively. The phylogeny of cephalopod 
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Figure 2 Molecular clock and exon-intron structure of cephalopod haemocyanin. Relaxed molecular clock with a mean evolutionary rate of 
7.636 x 10~ 4 calculated with BEAST v1 .5.4 based on a maximum likelihood tree for the amino acid sequences of the complete haemocyanin 
molecules of Sepia officinalis (SoH1-SoH3), Enteroctopus dofleini (OdHA, OdHG), Nautilus pompilius (NpH) and Haliotis tuberculata (HtHI, HtH2). The 
split of gastropoda and cephalopoda about 550 ±50 Mya was used as calibration point. Included are the exon-intron structures of the haemocyanin 
of N. pompilius, E. dofleini and 5. officinalis. Grey bars indicate the 95% HPD (highest posterior density), i.e. the Bayesian confidence interval of the 
estimated age. 



haemocyanin with the two isoforms of the gastropod 
Haliotis tuberculata as an outgroup is displayed in 
Figure 4. 

Likewise, the phylogeny of FUd and FUd' is shown in 
Figure 5. 

The relaxed molecular clock with a mean evolutionary 
rate of 7.636 x 10" 4 and a standard deviation of 0.237 re- 
vealed divergence times for the split of Tetrabran- 
chiata-Dibranchiata of about 500 Mya and the split of 
Octobrachia-Decabrachia of about 267 Mya (Figure 2). 
Accordingly, the first duplication of cuttlefish haemocya- 
nin occurred about 163 Mya and the second one about 
100 Mya. 

Discussion 

Haemocyanin during embryogenesis 

SoH3 is the most prominent haemocyanin isoform 
expressed in all analysed embryonic stages, whereas 
SoHl and SoH2 expression starts during organogenesis 
(Lemaire 18-30). After hatching, the expression of SoH3 
decreases drastically in the branchial gland (see also 
[22]). Hence, we could observe a shift of the haemo- 
cyanin isoforms during ontogeny. Accordingly, SoH3 
represents an embryonic haemocyanin, whereas SoHl 
and SoH2 are dominant in adult specimens. This is 
supported by the results of Decleir et al (1970, 1971) 
[47,48], who detected electrophoretically distinct protein 
fractions of cuttlefish haemocyanin during ontogeny. 
However, Ruth et al (1999) [63] could not detect hae- 
mocyanin in sections of the midgut gland and branchial 
gland of Sepia embryos younger than NAEF stage XVII 
(representing E2 of this study) by immunohistochemistry. 
In contrast, Wolf et al [64] found evidence for 



haemocyanin in extracts of eggs and embryos (larger than 
0.5 mm in mantle length) using Immunoelectrophoresis. 
In vertebrates, a developmental shift of the respiratory 
protein is well studied [40,41] and also for invertebrates it 
has already been described [31,37,38]. Yet Sepia officinalis 
represents the only mollusc for which an embryonic form 
of haemocyanin has been published. Only a truncated 
haemocyanin form has so far been found in egg masses of 
Biomphalaria glabrata [65], a freshwater snail exhibiting 
haemoglobin as their respiratory protein. Additionally, we 
have some evidence for an embryonic haemocyanin iso- 
form in the keyhole limpet Megathura crenulata accord- 
ing to protein analysis (Lieb et al, unpublished). 

SoH3 mRNA was also detected in yolk sac samples of 
S. officinalis, which at least in some way compares to 
humans, where the formation of blood cells also starts 
in the yolk sac [66]. Yet, this raises the question about 
the site of synthesis of haemocyanin during early em- 
bryogenesis. In Cancer magister the haemocyanin found 
in oocytes is of maternal origin and is subsequently 
replaced by an embryonic one [33]. In the gastropod 
Haliotis asinina no maternal haemocyanin mRNA could 
be detected, but the expression of haemocyanin started 
9 h after fertilisation [67]. Considering that SoH3 mRNA 
is present in the embryo until hatching, it might rather 
represent an embryonic than a maternal haemocyanin, 
as maternal mRNA also requires a maternal translation 
machinery and is usually essential during early embryo- 
genesis, but starts to be degraded partly from egg activa- 
tion on and partly from the onset of zygotic transcription 
[68]. Decleir et al [48] proposed the production of 
embryonic haemocyanin of cuttlefish in the perivitelline 
membrane of the yolk sac. In different cephalopods this 
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S.0.Hc1_FUa KLIRKNVDTLSH DEI LNLQIALRNMQ DDEGATGYQAI SAYHGEPADCKAADG-SQIVCC LHGM PTF PMWHRLYMVQFEQAVAGHGS TLGI PYWDWTK PMSH LPELVQH PLFI DPNGQKAKKNVFYSGEIKF — ENRVTARAV 

S.0.Hc2_FUa KLIRKNVDTLSH DEI LN Ifl I A I^NMQDDEGATGYQAISAYHGE PA DCKAA 1X3- SQ I VCC LHGM PTF PMWHRLYMVQFEQAVAGHGS TLGI PYWDWTK PMSH LPELVQH PLFI DPNGQKAKKNVFYSGEIKF- -ENRVTARAV 

S.0.Hc3_FUa KFVRKNVDQLTS DEI LELQVALRSMQDDDGPTGYQAISAYHGEPADCKAADG-S TIVCC LHGM PSFPMWHRLYIVQFEQAMASHGSKLGI PYWDWTKPMTH LPELVQH PLFI DPSGKKAKKNVFYSGEIKF — EHKVTARAV 

S.0.Hc1_FUb EMKWDV — TKLKTRKDVSS LTDEEKY DLRVAMERFMS DKS INGYQALAEFHGLPAKC PRPDALNRVACCVHGMATF PHWHRLVWQFEDALINRGS PIGV PYWDWTK PMKALPDLLADETYVDPYTKETKPN PFFSAPI EFMHAGVS TKRVI 

S.0.Hc2_FUb NMVNMDT — SH LQFRK DVNTLTTEEQYELRVAMERFMS DKS INGYQALAEFHGLPAKC PRPDALNRVACCIHGMATF PHWHRLWMQFEDALVARGS PIGVPYWDWTK PFTALPNLLAEETYVN PYTQESK PNPFYRASI DFLKADVH TSRQI 

S.0.Hc3_FUb GMRDVDE — TG LKTRK DINRLTTEEEYKLRLAMERFMADKS ISGYQALAEFHGLPAKC PRPDALNRVACCAHGMATF PHWHRLVWQFEDALTARGS PIGVPYWDWTK PLDS I PHLLADENYVDPY TKETK PN PFYRAAI EFLKADVF TSRKI 

S.0.Hc1_FUc EDVHFD EQHRSRKNVDSMTKS EMEDLRKAMTAFAADKGVTGYQQVAAFHGS TKWC PS PDAAQKYACCHHGMATF PHWHRLLTLNFENGLRRNGYSGGI PYWDWTRPI EALPALVLEEQY TDSNG- ESH PN PFFSGAI DE--AGAVTSRAP 

S.0.Hc2_FUC DGDVHFD EQHRSRKNVDSMTHS EMDDLRKAMTAFAADKGVTGYQQVAAFHGS TKWC PS PDAAQKYACCHHGMATF PHWHRLLTLNFENGLRRNGYSGGI PYWDWTRPI EALPALVLEEQY TDSNG- ESH PN PFFSGAI DE--AGAVTSRAP 

S.0.Hc3_FL)C ESVKFD ETHRSRK DVASMTKHEMNNLRVALAAFAADKTI TGYQQVAAFHGS TQWC PS PDAAVKYACCQHGMATF PHWHRLLTVNFENGLRRNGYHGGV PYWDWTR PIHALPTLWEEQY TDSKG- EAH PN PFFSGEI DE — AKAKTSRAP 

S.0.Hd_FUd KYGEGRAWI EPVTSAVRIRKNLN DLSGGELI- LRNYIKQMTKDGS YQQIAAFHGLPAQC PS EDGTTVH TCC LHGM PTF PHWHRLYVALVEDELLSRGSRSGRPYWDWI DPFDRLPDFFN DATYYNSRTLHI ESN PFFRGSMSF — AN TLTDRDA 

S.0.Hc2_FUd HFGEGRRWI S PVTSASHIRKNLDDLS DGEMES LRNAFKQMAN EGR YEQIAAFHGVPAQC PNEDGTMVH TCC LHGMPIF PHWHRLYVALVEDELLARGIRSGRPYWDWVEPF DELPRLINEATFYNSRTLKI EPN PFFHGKI SF-- ENMETDRDP 

S.0.Hc3_FUd KYGQGKKWMDPISSASRIRKNLKDLNEGEI ES LKATFQQMKKDGK YEQIAAFHGLPAQC PNEDG TKVY TCC LHGM PTF PHWHRLYVALVEDELLARGSGVAV PYWDWI EPF DTLPKLI S EPS YYNSRS LHI ERN PFFKGTI SF-- ENAI TDR DP 

S.0.Hc1_FUd' SGGK DAWGGVI TSANRIRRNLK DLNKDQMICFAEAFRSMAESGR YEEIAAFHGLPAQC PDESG DKVF TCC LHGMPVF PHWHRLY LA LVENELMSRGS CI AV PYWDWI EPF DVLPVLIN DLTYYN PQTNKVH PN PF LKGNISF — ASS ETHRS P 

S.0.Hc2_FUd' EFEEDSWRDWTSANRIRRNLK DF TEEDMLS LRTAFKRMTDDGR YEEIAAFHGLPAQC PNADG TMVH TCC LHGM PTF PHWHRLY LS LVENELLARGS DVAV PYWDWI EPF DH LPRLIN DKSYRH PK TH EEI SN PFHHGTISF-- DNS I TVRKL 

S.0.Hc3_FUd' AAEEKSWNGI I TTADRIRK DLKALSKDEMTCIAEAFKRMN DDG T YEKIAAFHGLPAQC PTEDGSMVY TCC LHGM PTF PHWHRLYVALVENELLARGSCNAV PYWDWI EPF DKLPQLVS DKTFYN PNTKKI EPN PF LKGSI SF — EN TETHRN P 

S.0.Hd_FUe H TVQSATHSN L — YGN LVRKNVDRLS LQEVNS LVHALTRMQKDKSADGFETIASFHALPPLC PN PTAKHRYACC LHGMATF PQWHRLHVVQFEQALHRHGA TVGVPYWDWTY PMKEVPHLLTS EKY TDPF TGVETFN PFNHGH LSFI S PETMTTRDV 

S.0.Hc2_FUe RTMRTSKSKSV — RGN LVRKNVDRLS LQEVNS LVHAIjKRMQKDRSS DGFESIASFHALPPLC PN PTAKHRYACC LHGMATF PQWHRLYWQFEQS LNRHGA TVGVPYWDWTY PMKEVPHLLTS EKY TDPF TGVETFN PFNHGH LSFI S PETMTGRDV 

S.0.Hc3_FUe RVIHGATYNYSI LVRKNADRLS LKEVNS LVHALKRMQKDISC DGFEAIASFHALPPLC PN PTAKYRFACCVHGMATF PQWHRLYWQFEQALHRHGAIVGV PYWDWTY PMKQI PS LFSS ENYTDPFTGVETYN PFNHGH I SFIS PETMTARDV 

S.0.Hd_FUf EH I EVAI PLNKIRRNIQ TLDERDIQNLMSS LRRLKEDES DFGFQ TIASYHG-S LLC PTPEA- PEYACC LHGMATF PHWHRVY LLHFEDAMRRHGANVAVPYWDWTLPISGLPS LLADADYY DVWS DSVI EN PF LRGFIKQ — EDTF TVRDI 

S.0.Hc2_FUf H DLEVAI PLNRIRRDINS LETRDVQNLMSALKRLKDDES DFGFQ TIASYHG-S LLC PTPDA- PQYACC LHGM PTF PHWHRVY LLHFEEAMRRHGASVAI PYWDWSMPSSH LPALLG DAAYY DAW T DSVI EN PF LRGYIKQ — EDTY TVRDI 

S.0.Hc3_FUf EHVEVER PLNKIRRNI ETI EERDIQNLMSALRQLKS DES TFGFQKIASYHG-SKMC PTPEA- PEYACC LHGM PTF PHWHRVY LLHFEEAMRRHGANVAV PYWDWTLPIS ELPS LLSKADYY DACADTVNEN PF LRGYIKY — EDTY TVR DI 

S.0.Hc1_FUg HLI EKEKPHSAGAAVRKNVNS LTPYETNNLRMALAAVQADI TDTGFQKIASFHGMPLSCKY DNGSA-YACCQHGMVTF PHWHRLYMKQMEDALRARGAKVAI PYWDWTTAFSS LPALVTDE KNNPFHHSYI DV--AGTI TTRS P 

S.0.Hc2_FUg MVSQHIEDHDTETLIRKNVNSLSPSEIKNLRDALVAVQADKSGNGYQKIASYHGMPLSCHYPNGTA-FACCQHGMVTFPHWHRLYMKQMEDAMKAKGAKIGIPYWDWTTTFSHLPFLVTEP KNN PFHHGYI DV — ADTKTTRN P 

S.0.Hc3_FUg REVHEEKIHKVGNLIRKNVDS LTDAEI DN LRDALAAVQADKTENGYQKIAAYHGMPLSCHY PNGTA-YACCQHGMVTF PHWHRLFVKQMEDALRTRGARVGI PYWDWTTAFSS LPALVTDT KNN PFHHAHI DV — AN TDTTREP 



E.d.HcG_FUg 



S.0.Hd_FUa DARLY EASKEGHKNF LLEGVLNALEHEDYCHFEVQ LEVAHN PIHY LVGGRF THSMSS LEY TS Y DPLFFLHHSNVERQFALWQALQKHRG LS T-RPNCGLNLFHS PMEPFGRDS — NPFAI TKDNAKPSSVFDYE-HLGYEYDDLTLNGMSIGELDTL 

S.0.Hc2_FUa DARLYEASKEGHKNFLLEGVLNALEHEDYCHFEVQLEVAHNPIHYLVGGRFTHSMSSLEYTSYDPLFFLHHSNVERQFALWQALQKHRGLST-RPNCGLNLFHIPMEPFGRDS — N PFAI TKDNAK PSSVF DY E-H LGYEY DDLTLNGMTVEELETL 

S.0.Hc3_FUa DARLYDASKKGHENFLLEGVLNALEHEDFCRFEVQFEVAHNPIHYLVGGRITHSMSSLEYTSYDPIFFLHHSNVERQFALWQALQKHRGLST-KANCGLNLFHEPMEPFGRET — N PIALTK DNAK PSSVFEY D-Q LGYEY DDLTLNGMS LEELNNL 

S.o.Hc1_FUb DARLFKEPTVG dhs y LH dglmlafeqh dfc DFEVQFEVTHNAIHRWTGGS EPYSMSS lhy TSFH PMFWLHHS evdrlwsiwqs lqihrg lpy-kahcass evyq pmk PFAF DAPLNN DELTHSHSVPTDI y dyqs eleyaf dtlffggms irelqeh 

S.o.Hc2_FUb ddrlfkq pskg dhgf lf dglllafeq ddfc dfevqfevthnaihawtggs EPYSMSS lhy tsf dpmfrlhhsqvdrlwaiwqalqiqrgk py-ktycans evyrpmk pfafeaplnnnehtrehsvptdvy dyqadlhy ty dtlffggms irelqrh 

S.o.Hc3_FUb derlfkkBIvg dhgy lf dgmllafeqh dfc dfevqfevthnaihawvggnepysms tlhy tsf dpmfwlhhsqvdrlwaiwqalqiqrglpy-kahcass evhq plkpfafq pplnn delthshs I PTDI Y dyrtelky ty dtlffggms irelqhh 

S.o.Hc1_FUc s eh lyekpdfgky telanevi yalekedfc df dvqfeiahnhihalvggteafsmss leysaf dpifmlhhsnvdriwatwqalqkfrgk pyntancai ellrkpms pfs las dinadamtrehsvpfdvf dykkafhyefdtlelngms I PQ LSRE 

S.o.Hc2_FUc s eh lyekpdfgky telanevi yalekedfc df dvqfeiahnhihalvggteafsmss leysaf dpifmlhhsnvdriwatwqalqkfrgk pyntancai ellrkpms pfs las DINADAMTREHSVPFDVF DYKKAFHYEFDTLELNGMS I PQ LSRE 

S.0.Hc3_FUc SATLFEQPEFGHYTHLADEIFYALEQENFCDFEVQFEIAHNHIHALVGGTELYSMSSLEYSAFDPIFMLHHSNVDRIWATWQALQKFRGKSYNSANCAIEMLRQPMS PFSLSS DIN PDAQTS EHSVPF DVF DYKKSFHYEY DTLELNGLSI PQLSRE 

S.0.Hc1_FUd QDVIYNN HEFY DKALLVLEQ TDFCEFEVQ LEVLHNRIHSMLGGREI YSMSS LDYAAY DPVFFLHHSNVDRLWAIWQELQRYRKLSYNEANCALPLLNEPMR PFSNKTA-NH DRLTF TNSRPN DVF DYQNVLHYKY DTLKFEGLSI PELEAL 

S.0.Hc2_FUd QPELFGN S Y LY DH TLFVFEQ TDFC EFEVHFEVLHN TI HSWLGGRDAH SMS S LDYAAY DPVFFLHHSNVDRLWAIWQELQRYRKLSYNEANCALPLLNEPMR PFSNKTA-NQ DWLTF TNSRPN DVF DYQNVLHYKY DTLSFAGLS I PQ LERI 

S.0.Hc3_FUd QPDLFNN NY LH DHALFAFEQ TDFC EFEVHFEVLHN TIHSWLGGR DPH SMS S LDYAAY DPVFFLHHSNVDRLWAIWQELQRYRKLSFNEANCALPLMNQ PMR PFSNS TA-NH DRLTF THSRPS DVF DYQNI LHYKY DTLSF DGLSI PEVES L 

S.0.Hc1_FUd' TKELYGN RY LY DHALFALEQ TDFC EFEVHFEVLHN TIHSWIGG PSRHSMSS LDFAAY DPVFFLHHSNVDRLWAIWQELQRYRKLDYNTATCAKNYLNK PMR PFSNS TA-NH DRLTYVNSK PNDVF DYQNVLHYKY DSISFSGLNI PQLENV 

S.0.Hc2_FUd' NDKLFNN RY LYEHALYAFEH TDFC DFEVHFEVLHN TIHSWIGG PN PHSMSS LDYAAY DPVFFLHHSNVDRLWAIWQELQRYRKLDYNVANCALHLLS DPMR PFNNKSA-NH DHLTF THSRPN DVF DYQNS INYKFDTLSFSGLSI PQLDDL 

S.0.Hc3_FUd' RPQLFGN RY LY DHALYALEQ TDFC EF EVHFEVLHN TIHSWLGG PSVHSMSS LDYAAY DPVFFIHHSNVDRLWAIWQELQRYRN LDYN TAACAKNY LNKPMR PFSNETA-NRDRLTMMNSK PN DVF DYQNVLHYKY DTLSFSGLNI PQ LENV 

S.0.Hd_FUe S EH LFEQ PALGKQ TWLFNNI LLALEQ TDYC DFEVQFEIVHNAIHSWLGGKEI YS LNH LHYAAY DPAFY LHHSNVDRLWVIWQELQKFRGFSAH ESNCAI ELMSQ PLK PFSFG PPYNLN PMTTKYSK PS DVFNYKEHFHYEY DMLEMNGMS IAQ LESY 

S.0.Hc2_FUe S EH LFEQ PALGKQ TWLFNNI LLALEQ TDYC DFEVQFEIVHNAIHTWLGGKEIYSLNHLHYAAYDPAFYLHHSNVDRLWVIWQELQKFRGFS AY ESNCAI ELMSQ PLK PFS FGAPYNLN PMTTKYSK PS DVFSYKEHFHYEY DMLEMNGMS IAQ LESY 

S.0.Hc3_FUe NSHLFEQ PALGKQ TWLFDNI I LALEQ TDYC DFEIQFEIVHNAIHSWLGGKELYSLNHLHYAAYDPAFYLHHANVDRLWVIWQELQKFRGLPAYESNCALELMNEPLK PFSFG PPYNLN PI THKYSK PS EVFNYKDHFHYEY DMLEMNGMS ISQ LESY 

S.0.Hd_FUf QPELYKFAEGGKVSVLYKQVMLMFEQEDYCDFEIQLEVIHNSIHYLLGGYQKYAMSSLVFSSYDPAFYIHHSMI DRLWAIWQELQHFRKLPDKKAFCALDQMSFQMKPFHWDS — N PNAH TRSSS TPAKLF DYK-G LGYI Y DDLTFHGMS IAQ LEAA 

S.0.Hc2_FUf QQELFH LS EGGKES TLFKKAMLMFEQEDYC DFEVQFEVIHNS IHY LIGGHQKYAMSS LVYSSF DPVFYVHHSMVDRLWAIWQ DLQQYRKLPHNKAYCALDEMSF PMK PFIWES — N PN PATRAVS TPNKLF DFK-S LGYNY DNLDFHGMDTAH LEAA 

S.0.Hc3_FUf Q PELFNFTEGGKVS LLYKQVMLMFEQEDYC DFEVQ LEVIHNS IHY LLGGHQKYAMSS LVFSSF DPIFYIHHSMVDRLWAIWQELQQFRKLPYNKAYCALHQMAF PMK PFAWES — N PN PHSRSVATPDQ LF DYK-K LGYVY DDLSFHGM DIAQ LDAA 

S.0.Hc1_FUg RPQLFDDPEHGDESFFYRQIALALEQRDFCDFEIQFEMGHNAIHSWVGGPS PYGMS TLHY TSYDPLFYLHHSNTDRIWAIWQALQKYRGLPYNSANCEINKLKKPMM PFS — S DDNHNEVTKAHS TGIKSF DY-HELNYEY DNLNFHGMTI PQ LEVH 

S.0.Hc2_FUg RPQLFDDPEQGDQSFFYRQIAFALEQRDFCDFEIQFEMGHNAIHSWVGGSS PYGMS TLHY TSYDPLFYLHHSNTDRIWAIWQALQKYRGLPYNSANCEINKLKKPMM PFS — SDDNPNEVTKAHSTGTKSF DY-HELNYEY DNLNFHGMTI PQ LEVH 

S.0.Hc3_FUg RPQLFEDPEQGDQSFFYRQIAFALEQTDFCDFEIQFEIGHNAIHSWVGGSSPYGMSTLHYTSYDPLFYLHHSNTDRIWAVWQALQKYRGLPYNSANCEINKLKKKLMPFS — SEENHNVETKAHSTGFKSFDY-HQ LHY DY DNLNFHGMTI PQLDNH 

E.d.HcGFUg 



S.0.Hd_FUa LKVRKSQARAYASFRLGGIKTSANVRINICI PSK DGRHS DNC DNYAGEFFI LGGIHEMPWDFS Y PY LH 

S.o.Hc2_FUa lkgrkskarayanfrlggiktsanvriklcipskdkrqsdncdnyageffilggihempwdfaypylh 

S.o.Hc3_FUa lkqrkskarafanfrlggiktsanvrisici pnkdgrhs DNC DNYAGEFFI lggkhempwhfay pyly 

S.o.Hc1_FUb IQENELKDRVFAGFLFMGIKTSANIDFYVV AGENEFMAGSIAVLGGSKEMSWRFDRVYKY 

S.0.Hc2_FUb VEEAKSKDRVFAGFLLMGIHTSANVDLYVV aggnefsvgsiailggskemtwrfdrvykh 

S.0.Hc3_FUb I EENEAK DRVFAGF LLMGIHTSANVDIYVV AG DN EFMAGS I AV LGGSK EMSWRF DRVYKH 

S.0.Hc1_FUc INRRKSKNRVFI TFMLEGLKKS LLVEYYIK DDGS DNKMKAGEFYVLGS ENEMPWKF DRAYKS 

S.0.Hc2_FUc INRRKSKNRVFISFMLEGLKKS LLVEYYIK DDGS DNKMKAGEFYVLGS ENEMPWKF DRAYKS 

S.o.Hc3_FUc INRRKAKBRVFV TFMLEGLKKS LLVQYYIK ddasdkkmkagefyvlgsenempwkydrafky 

S.0.Hd_FUd LNKRKSH DRVFAGF LLHGIKTSADIS I YICVPT — GVNEENCGNYVGTFSALGGES EMPWNF DRLFRY 

S.0.Hc2_FUd LGKQQRH DRVFAGF LLHGIKASADVRI YICVPT — GVGEENCGNYAGIFSVLGGVTEMPWRF DRLFRY: 

S.0.Hc3_FUd LQ I RKAHDRIFVGF LLHGIKASADVRI YICVPT — GVGQ DNCGY-AGIFSI LGGTS EMPWMF DRLFRY 

S.0.Hc1_FUd' LTNNKAH DRVFVGFS LHGIKASADVRI YICV PV — GQEEKNC DHYAGIFSVLGGETEMPWQF DRLFRY 

S.0.Hc2_FUd' LESRKSHDRVFGGFWLHGIKASADVKI DICVPI — GVEH EDC DNYAG TFAVLGGETEM PWNF DRLFHY 

S.0.Hc3_FUd' LANNKAH DRVFVGFS LHGIKASADVRI YICVPT — GNGVSSC DHYAGVFSI LGGLTEM PWQF DRLYRY 

S.0.Hc1_FUe I RQ EKQK DRVFAGF LLEGFGS S AYA TFQ I C S DAGECHD-GSHFSVLGGS TEMPWAFDRLYRI 

S.0.Hc2_FUe IRQEKQKDRVFAGFLLEGFGSSATATFQIC pdsgdche-gshfsvlggsaempwafdrlykm 

S.o.Hc3_FUe irhekqkdrmfvgfllkgfgssayatfeic ddaghche-gshfsvlggsaemqwafdrlyr: 

S.o.Hc1_FUf iqrqkeddrvfagfllhgiktsadmhlkvc neadcke-agtifilggetemawhfdrnyrf 

S.0.Hc2_FUf IKKQKQK DRVFAGF LLHGIK TSA DVH LKVC NAADCHE-AGWFVLGGETEMPWHFDRNYKM 

S.0.Hc3_FUf IQKQKEN DRVFAGF LLHGIHTSADVH LKVC KETECKN-AGVLFVLGGETEMPWHFDRNYKM 

S.0.Hc1_FUg LNKIQEK DRVFAGF LLRAIGQSADVNF DIC RKDGEC-HFGGTFCVLGGQHFMAWAFDRLFLY 

S.o.Hc2_FUg lnkiqekdrvfagfllraigqsadvnfdic rkdgec-kfggtfcvlggqhemawafdrlfly 

S.o.Hc3_FUg lkkyqekdrvfagfllrtigqsadvnfdvc rkdgkc-tfggtfailggqhemfwafdrlfly 

E.d.HcG_FUg 



eitdtvnslgl plsgnyyvqaivtaingtllpdgiipnpivsyvpsvglkd 

eitdtvnslgl kldsnyyviaevtaingtlmpseviphptvtyvpprgfkd 

eitdkvhalgl s ldadyyvqaBvtaingtlmpddi i pBpivsyi pakdyh d 

El TAA IAA LGV DK YA EY TLRV DI K DVNG TA LS AN I LPS PTVI F T PGK 

EI THA LES LGV DKFA EY T LRV DI K DVNG TA L P P TAI PA PI VI FV PGH 

DI TS A LAA LGV DKFGDY TLRV DI K DVNG TA L PS TI I PA PI VI Y T PK I 

DITHVMDEMKL HYTDKYHVEYKIS DMTGAEVTDIKLE-SSWFEPGL 

DITHVMDEMKL HYTDKYHVEYKIS DM TGAEVTGIKLE-SSVIFELGL 

DI THVMDEMKL HY TDKYHI EY TVTDLKG DAVENVKL-N PTVVFQ PGF 

EI TDELMLLGL NHNSHFRISI EIMTMNRSIASSKI F PN PTI IFVPGA 

EI TNELKKLS L NQNSHFRVAMEVTAVNGSRI TQKI F PN PTI IFVPSH 

EI TDELKLLGL NHESNFHIKTWKAVNGSKINQKIFPDPTVIFV PSH 

EISHA LNALEL THKS DF TIKVEVTAANGTRINEKIF PEPTI IFAPRE 

EI TDYMNK LH L TQESKFH LTTKI IAANGTKVSN DIF PS PSWWIHGK 

EI TEELKNLGL SHKSKFEIKTEI IAVNGTI I DHKIF PEPAI IFVPKE 

EI TDI LKDMGL QFDSHFTIKVNLLAHNGTEL- TGVLTEPTVVRI PPE 

EI TDI LHDMKL NFDSHFTIKTKWAQNGTELPESI LPEATIIRI PPS 

EITKIIHDMHL QFDSHFTVKTKLVAQNGTEL-MGLLHEPTIIRVPAE 

EI TSVLEEMKI PF DKLFEHESKIHWEVEI IKVDGTPVDAGVI PK PS LI Y LPGK 
DI TDVLHEMHI PMEALFEH DSKIH LEVEIQSVDGAI LDSHS LPT PS LI YAPAK 
EI TDVLKEMKI PLQELFQHNS TIH LEVEIMKVDGTS LDASAI PK PS LI VI PGK 

DI TKA INK LH L DAY DDF LINVSIVN I EGVK L PS S LL PR PTIMFK PGKG TQHHH 

DISRTLLQLRL DAH DDF DVKVTIMGI DGKS LPTTLLPPPTI LFK PG TG TQ LTR 

DISKS LNELKL DAH DAF DLKLTI TAING DI LPNDLLPPPTI TIK PG TGNNHH 



disulfide bridge 
thioether bridge 

copper ligand histidine 
aminoacid ambiguity 
potential N-glycosylation site 
blocked N-glycosylation site 
(3-strand in E.d.HcG_FUg 
a-helix in E.d.HcG_FUg 



Figure 3 Alignment of the three cuttlefish haemocyanins (SoH1-SoH3). Highlighted are the characteristic six histidine residues forming 
the oxygen binding sites (green), one thioether bridge, three disulphide bridges and the potential glycosylation sites predicted with NetNGIyd.O. 
The secondary structure of the FUg of £ dofleini (Cuff et al., 1998) is displayed on the bottom. 



site was indeed described as an enzymatically very active 
one yet only with respect to yolk digestion [69] . A produc- 
tion of embryonic haemocyanin in the yolk sac could very 
well also be the case as copper, the metal ion complexing 
oxygen in the haemocyanin molecule, is found in the 
yolk sac during early development and is only later 
on transferred into the embryo, whereas the total amount 
of copper remains constant during embryogenesis [70]. 
Furthermore, the yolk sac may be favoured for oxygen 
uptake in the early stages because of its greater surface 
compared to the embryo [71]. 

The expression site of haemocyanin is not uniform 
among the Mollusca and ranges from the rhogocytes/ 
pore cells in the midgut gland in many gastropods [72-74] 
and Nautilus (a rather ancient cephalopod, which did not 



evolve branchial glands [63,75]), to the branchial glands in 
adult coleoid cephalopods. According to Beuerlein et al 
[49] and Ruth et al [75], the haemocyanin of cuttlefish 
embryos is also expressed in the midgut gland. This shift 
of the site of synthesis of haemocyanin during ontogeny 
may reflect an evolutionary shift from the midgut gland to 
the branchial gland in more highly evolved molluscs, 
namely recent coleoid cephalopods. The different expres- 
sion patterns of the isoforms observed before hatching 
(E4) in the complete embryo (this study) and those 
observed only in gills (see [22]) may point to tissue differ- 
ences in the expression and synthesis of the different 
isoforms at that developmental stage. Decleir et al [48] 
proposed a shift of haemocyanin expression from the 
perivitelline membrane in early embryonic stages to the 
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96.2/100 

SoH2 

0.1 

Figure 4 Maximum likelihood tree of the amino acid sequences 
of haemocyanin. The tree includes the haemocyanin sequences of 
Sepia officinalis isoform 1-3 (SoHl, SoH2, SoH3), Enteroctopus dofleini 
A-type (OdHA) and G-type (OdHG), Nautilus pompilius (NpH) and 
Haliotis tuberculata isoform 1 and 2 (HtHI, HtH2) calculated with 
PhyML using the LG + I + G model to illustrate the evolution of 
cephalopod haemocyanin. The tree topology of the Bayesian tree 
calculated with MrBayes using the WAG + I + G + F model is similar. 
Therefore, support for the tree nodes is given by the bootstrap 
values of 1,000 replicates and the consensus support (%). 



midgut gland when both tissues get in close contact and 
eventually to the branchial glands when they are fully 
developed. This was supported by Beuerlein et al [49] at 
the mRNA level and immunologically by Ruth et al [63], 
who concluded that these tissues undergo a switch of 
function in haemocyanin metabolism during ontogeny. 
However, as none of these studies distinguished between 
the different haemocyanin isoforms, the different isoforms 
cannot be assigned to specific expression sites for the 
time being. 

The sequence data indicate that SoH3, which is the 
first isoform to be expressed during ontogeny, resulted 
from the first gene duplication and could be closest 
to the common ancestor of cuttlefish haemocyanin 
(Figure 4). The isoforms SoHl and SoH2 resulted 
from a second gene duplication. SoHl, whose expression 
starts last, may represent the most derived isoform 
referring to the number of substitutions per site. As the 
differences in substitutions per site between the cuttlefish 
haemocyanin isoforms are fairly small, this assumption is 
speculative. On the other hand, molluscan haemocyanin 
generally has a nearly constant and only moderate evolu- 
tionary rate; therefore large differences between isoforms 
are not to be expected. In this line of thought, the domin- 
ating haemocyanin isoform as well as the site of synthesis 
of haemocyanin could represent a rather ancient state 
during embryogenesis. This insinuates that the ontogeny 
of cuttlefish haemocyanin, i.e. the shift of composition and 



- NpH-FUd 



pOdHA-FUd 
100/100 
— OdHG-FUd 



SoH1-FUd 



80.2/100 



60.3/98.6 



SoH2-FUd 



SoH3-FUd 



41.8/90.8 



SoH2-FUd' 



87.5/100 



99.2/100 



SoH1-FUd' 



SoH3-FUd' 



Figure 5 Maximum likelihood tree of the amino acid sequences of FUd and FUd'. The tree includes the FUd and FUd' sequences of Sepia 
officinalis (SoH1-SoH3), Enteroctopus dofleini (OdHA, OdHG) and Nautilus pompilius (NpH) and was calculated with PhyML using the WAG + I + G 
model to illustrate the duplication of FUd/d'. The tree topology of the Bayesian tree calculated with MrBayes using the WAG + G + F model is 
similar. Therefore, support for the tree nodes is given by bootstrap values of 1,000 replicates and the consensus support (%). 
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expression site [49,63], may recapitulate its phytogeny, 
which is in accordance with Haeckels biogenetic law [76]. 

Physiological aspects 

Although the timing for the completion of the final site 
of haemocyanin synthesis, the branchial glands, would 
be well in line with the start of expression of SoHl and 
SoH2, the change in expression site does not explain the 
need for a special embryonic version of the respiratory 
protein. SoH3 could additionally exhibit physiological 
properties, which might suit the embryonic situation bet- 
ter. Crustacean and mammalian embryonic respiratory 
proteins for example show different oxygen affinities 
compared to their adult equivalents to cope with the 
challenges during embryogenesis [41,43] and shifts of 
haemocyanin function and expression have also been 
discussed for gastropods [67,77]. Cuttlefish embryogenesis 
is characterised by hypoxic and hypercapnic conditions in 
the perivitelline fluid (PVF) towards the end of deve- 
lopment [44] resulting from the egg capsule acting as a 
diffusion barrier [45,46]. According to Gutowska and 
Melzner [44], the PC0 2 of the PVF in eggs of Sepia offici- 
nalis increases during embryogenesis from 0.13 kPa to 
0.41 kPa (0.04 kPa in ambient seawater), corresponding to 
a decrease in pH from 7.72 to 7.23. 

The 0 2 affinity of cephalopod haemocyanin is strongly 
pH-dependent [29] and could therefore be influenced by 
increasing carbon dioxide pressure CPC0 2 ) towards the 
end of embryogenesis, which is estimated to rise to 0.6- 

0. 8 kPa at the end of embryogenesis [44]. Furthermore, 
respiratory proteins of organisms living under hypoxia, 

1. e. the oxygen minimum zones (OMZ), express higher 
oxygen affinities in order to cope with the reduced oxygen 
availability [78,79]. The three haemocyanin isoforms of 
S. officinalis exhibit different theoretical isoelectric 
points (SoHl: 5.79, SoH2: 5.85, SoH3: 5.94) and different 
amounts of histidine residues (SoHl: 168, SoH2: 179, 
SoH3: 173) pointing at different buffer capacities and 0 2 
affinities. These could come in handy during embryogen- 
esis as abiotic conditions change, however clearly require 
further analysis before conclusions on the function of Hc3 
can be reached. 

Phylogenetic aspects 

The embryonic haemocyanin of cuttlefish already repre- 
sents the third isoform found in S. officinalis. It has the 
same genetic structure (presence of signal peptide, num- 
ber of FUs, exon/intron structure) as found in the other 
isoforms. As mentioned above, it could also be the iso- 
form most similar to the common ancestor of cuttlefish 
haemocyanin referring to the number of substitutions 
per site (Figure 4). The ambiguities that we could not 
clarify with the help of the genomic data may indicate 
SNPs between alleles. The unresolved positions observed 



in the cDNA sequence only could therefore be a result 
of pooled samples or even propose the existence of 
more than three isoforms, as fragments differing from 
all haemocyanin sequences found so far in the cuttle- 
fish were also found at the genomic level. 

Referring to the number of oxygen binding sites and 
the molecular weight of haemocyanin proteins, it has 
been suggested that Nautilus and Octopus exhibit 7 FUs 
whereas the haemocyanin of Sepia and Loligo consists of 
8 FUs [1]. As molluscan haemocyanin evolved from 
subsequent gene duplications, an additional duplication 
could have occurred within the Dibranchiata. This is 
supported by molecular data for Spirula spirula and 
Sepia officinalis SoHl, SoH2 and now also SoH3, which 
indicate a duplication of FUd/d' [11,13] (DeGeest, un- 
published, see also Figure 5) that is present in neither 
Nautilus nor Octopus [6,9]. It is rather unlikely that the 
duplication occurred independently in all four hae- 
mocyanins. Furthermore, the tree topology shown in 
Figure 5 supports the assumption that the FU duplica- 
tion was prior to the isoform duplication in cuttlefish, as 
orthologues of both FUs (e.g. SoHl_FUd', SoH2_FUd', 
SoH3_FUd') and not paralogues (e.g. SoHl_FUd, SoHl_- 
FUd') are assigned to each other. Referring to FUd, 
SoHl is more closely related to SoH2, whereas FUd' 
indicates that SoHl is more closely related to SoH3. 
These differences in topology between the two groups of 
orthologues resulting from different substitution rates 
also point to a duplication into FUd/d' much earlier than 
the duplication of the isoforms. This would support the 
assumption of FUd' being characteristic for Dibranchiata 
[13]. However, no sequence information for other groups 
within the Dibranchiata, e.g. squids, is available so far. 

In contrast to the duplication into FUd/d' the duplica- 
tion of the complete haemocyanin molecule occurred 
independently in several molluscs (Haliotis tuber culata 
[16,17], Megathura crenulata [19], Nucula nucleus [8], 
Enteroctopus dofleini [13], Sepia officinalis (DeGeest, un- 
published)). Compared to the rather recent haemocyanin 
duplication in Enteroctopus about 13 Mya, the first du- 
plication of the cuttlefish haemocyanin occurred about 
163 Mya resulting in the ancestor of SoH3 (Figure 2). 
The second duplication about 100 Mya led to the pro- 
genitor of SoHl and SoH2. These calculated divergence 
times correspond well to fossil data and previous results 
[6,12,13]. 

Conclusion 

The haemocyanin isoform 3 in cuttlefish represents an 
embryonic version of the respiratory pigment as it is 
expressed on a high level during embryogenesis, but is 
downregulated after hatching. Concerning its phy- 
siological properties only speculations based on the pi 
deduced from the primary structure can be done so far. 
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The structural organisation of SoH3 with respect to 
the exon-intron structure and the duplication of FUd/ 
FUcT reflects the situation found in the other cuttlefish 
haemocyanin isoforms. Compared to these, SoH3 prob- 
ably represents a rather ancient state with respect to 
its sequence and expression site, thereby leading to 
the assumption that phylogeny might be recapitulated 
during ontogeny. 
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